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Abstract 

We establish an exact asymptotic formula for the square variation of certain partial 
sum processes. Let {Xi} be a sequence of independent, identically distributed mean zero 
random variables with finite variance a and satisfying a moment condition E < oo 

for some (5 > 0. If we let Vn denote the set of all possible partitions of the interval [N] into 
subintervals, then we have that max^^-pj^ J^ien I X^ie/^iP ^ 2a'^N\n\n{N) holds almost 
surely. This can be viewed as a variational strengthening of the law of the iterated logarithm 
and refines results of J. Qian on partial sum and empirical processes. When S = 0,we obtain 
a weaker 'in probability' version of the result. 

1 Introduction 

Let {Xi} be a sequence of independent, identically distributed random variables with mean 
fi < oo. The strong law of large numbers asserts that 

N 



1=1 

almost surely. Without loss of generality, one can assume that Xi are mean zero by considering 
Xi — /i. If we further assume a finite variance, that is E = o"^ < oo, the Hartman- 

Wintner law of the iterated logarithm [8j gives an exact error estimate for the strong law of 
large numbers. More precisely, 

2 



N 

E 

1=1 



Xi 



< {2 + o{l))a'^Nlnln{N), 



(1) 



holds a.s., where the constant 2 cannot be replaced by a smaller constant. That is, the quantity 
J2iLi-^i g^ts as large/small as ±y/ (2 — e)aN lnln{N) infinitely often (a.s.). 

The purpose of our current work is to prove a more delicate variational asymptotic that 
refines the law of the iterated logarithm and captures more subtle information about the oscil- 
lations of sums of i.i.d random variables about their expected values. More precisely, we prove 
the following theorem. We let Vn denote the set of all possible partitions of the interval [N] 
into subintervals, and we consider each tt S Vn as a collection of disjoint intervals, the union 
of which is [N]. We write / S vr to denote that the interval / C [A^] belongs to vr. 

Theorem 1. Let {Xi} be a sequence of independent, identically distributed mean zero random 
variables with variance a and satisfying E < oo for some 6 > 0. Then we have almost 

surely: 



max y 



2a^Nlnln{N). 



(2) 
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Choosing the partition vr to contain a single interval J = [1,A^] immediately recovers ([T]), 
the upper bound in the law of the iterated logarithm. 

It is unclear if the assumption E [iXip^"^] < oo can be removed. The analysis here should be 
able to be pushed further to handle a condition of the form E < oo where ip{x) is 

a positive increasing function that grows slower than for any 5 > (this requires analogous 
refinements of Lemma [9] and Lemma [TT]) . however this will not extend to the case when ip{x) is 
bounded. Without an auxiliary moment condition, we are able to establish the following weaker 
'in probability' result. 

Theorem 2. Let {X,i} he a sequence of independent, identically distributed mean zero random 
variables with finite variance a. We then have that 

max^gp^ ZjgTT I J2iei ^ 
2a'^N\nln{N) 

Here, — > denotes convergence in probability. 

For the rest of the paper, we will denote {Xi}^-^ more concisely as {Xi}^ and write 
||{Xj}^||y2 for the quantity ^maxT^g-p^v X^/stt I X^ie/ indicated by the notation, this 

expression satisfies the triangle inequality: \\{Xi + Yi}^ \\v2 < \\{Xi}^ \\v2 + \\{Yi}^ \\v2 . (This 
can be easily verified.) 

1.1 Previous Work 

In [13], Taylor proved an almost sure asymptotic for the path variation of Brownian motion. 
This is closely related to our main result in the case that Xi are normally distributed. The 
question of the asymptotic order of ||{Xj}^||y2 in the case of more general Xi is implicit in 
work of Dudley [Ij and Bretagnolle [T] , where related questions about the variation of processes 
are studied. Dudley's interest in these p- variation norms stems from the fact that they majorize 
the (more classical) sup norm, but in many cases have nicer differentiability properties. 

The most recent result we are aware of concerning our specific question appears in the work 
of J. Qian [11]. There it is shown that 

Theorem 3. Let {Xi}'?2^-^ be i.i.d mean zero random variables with variance a . Then for some 
constant c we have that P [maxTrg-Pj^ S/eTr I Sie/ — co"^ In In(A^)] — t- as N ^ oo. 

Theorem 4. Let {Xi}^-^ be i.i.d mean zero random variables with variance a andE [\Xi\'^+^] < 

oo for some 5 > 0. For some constant c' we have that 

P [max^gp^ J^ien I Eie/ ^ cV2Arinln(iV)] ^ as iV ^ oo. 

These results have been used in |10j to show that that certain variation operators which gen- 
eralize (and majorize) classical maximal operators arising in harmonic analysis are unbounded 
on certain L^ spaces. 

Notice that Qian's upper bound does not require a moment condition with 6 > 0, but her 
lower bound does. Our results improve on these by establishing the exact constants as well as 
improving convergence in probability to almost sure convergence when 6 > 0. When only the 
second moment is finite, we obtain convergence in probability results with the exact asymptotic 
constant. 

1.2 Notation 

In our proofs below, we will often fix a positive integer and use [N] to denote the set of 
integers {1, . . . , A^}. When we say / is a subinterval of [N], we mean that / = {is,ie] for some 
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real numbers < is < < A^. We denote the length of the interval / by \I\ (i.e. \I\ = ie — is)- 
When is and i^ are integers, this quantity is equal to the number of positive integers contained 
in /. When we say /' C / (/' is a subinterval of /), we mean that I' = (i'g, . . . , i'^] for some real 
numbers i'j,,ig satisfying is < i'^ < i'^ < ie- 

We will consider independent, identically distributed random variables Xi, X2, ■ ■ ■- We will 
routinely use Si to denote the partial sum Si = Xi + - - - + Xi. For an interval /, we use Sj to 
denote Sj := "^i^j Xi (i.e. the partial sum of the variables Xi whose indices i are contained 
in the interval /). We use In to denote the natural logarithm and log to denote the base 2 
logarithm. We use exp{x) as an alternate notation for e^. 

In our proofs, we will often refer to "constants" whose values depend on 5 (and only on 5). 
We will often not reflect this dependence in our notation. Throughout our proofs, 5 should be 
thought of as a fixed, positive constant. 



2 The Upper Bound 

In this section, we prove the following theorem: 

Theorem 5. We let Xi,X2, - - - denote independent, identically distributed random variables 
with E[Xi] = and Var[Xi] = 1. We further assume that E[|Xjp(-^+^)] < 00, for some 5 > 0. 
Then, for every e > 0, 

1 1 {^j}"'^ 1 1 , ,2 
limsup — — — ; — < 2(1 + e) a.s. 

^^00 A^lnlnAf " ^ ' 

The proof of this theorem will proceed in several stages. First, we will fix e > and classify 
intervals / C [N] into three disjoint categories, "good," "medium", and "bad". (This same 
strategy is used in [13] •) We say an interval / is "good" if 

5| < (2 + e)|/|lnlniV. (3) 

We say it is "medium" if 

(2 + e)|/|lnlniV < S"! < 5|/|lnlniV, (4) 

and it is "bad" if 

S] > B\I\\nh).N. (5) 

The precise value of the parameter B will be chosen later. For now, we simply think of it as a 
constant depending only on 5. 

We will deal with each class of intervals separately. We begin by considering the contribution 
of the bad intervals to the value of ||{Xj}^||^2- 



2.1 The Bad Intervals 

To suitably bound the contribution of the bad intervals, we will begin by essentially reducing 
the space of allowable partitions. We assume for simplicity that is a power of two, denoted by 
N = 2'^. We will later argue that our results extend to all positive integers N. When N = 2^, 
we consider intervals of the form 

((c - l)2\c2% i G {0, 1, . . . , n}, c G {1, . . . , 2^^-'}. 

This gives us n + 1 = log(A^) + 1 levels of intervals, where the i^^ level contains 2"~* disjoint 
intervals, each of size 2*. We now augment this family of intervals by adding "half shifts" of 
each level. More precisely, we also consider intervals of the form 

((c - 1)2* + 2'-\c2' + 2'-\ i G {1, . . . , n - 1}, c G {1, . . . , 2""^ - 1}. 
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This approximately doubles our total number of intervals. We will call the first family of 
intervals and the second family of intervals J-g. J- includes n + 1 levels of intervals, while J-s 
includes n — 1 levels of intervals. Within each family at each level, the intervals are disjoint. 
We now show: 

Lemma 6. Let I' C [N] denote an arbitrary interval. There exists some interval I ^ F U Fg 
such that r Q I and \I\ < 4|/'|. 

Proof. We define i to be the non- negative integer satisfying 2'"^ < |/'| < 2*. If i > n — 1, 
then / := [1, . . . , A^] G suffices. For i < n — 1, we consider the intervals in F U J-g of length 
2*"'"^. There are two cases: either /' is contained in some / G of size 2*+^ (and therefore, 
we are done), or /' must contain a right endpoint of some /' G of size 2*"^^. In other 
words, /' contains c2*"^^ for some c G {1, . . . , 2"~*~^ — 1}. Since |/'| < 2*, this implies that 
I' ^ (c2*+-'^ — 2*,c2*+^ + 2*). We can alternatively express this as: 

/' C ((c - 1)2^+^ + 2\ c2'+^ + 2'] G Tg. 

So in both cases, we have an / G U 7-"^ such that I' I and |/| < 4|/'|. □ 

For the purpose of bounding the contribution of bad intervals, this allows us to consider 
only intervals in J^L) J-g (to some extent). More precisely, for each interval / G U we will 
consider the random variable 

max Sf/ . 

I'CI 

If some /' C / of size |/'| > ^\I\ is bad, meaning that \Si'\'^ > Inln A^, then 

maxS?/ > — III InlnA^. 
I'CI 4 

To enable us to later consider values of N which are not powers of two, we actually consider 
"badness" with respect to N/2 instead of A''. More precisely, we let Isadj denote the indicator 
variable of the event 

maxS*?/ > — |I| InlnA^. 
I'CI ^ 8 

for a particular interval /. Here we have used the very loose bound that lnln(A^/2) > ^ Inln A^, 
for A^ > 4. Note that / can contain a subinterval of size > which is bad with respect to 
A^/2 (or anything between A^/2 and A^) only when this event occurs. 

It is then clear that the contribution of the bad intervals to the value of | 1 is upper 

bounded by: 

3 ^ ma^ Sj, ■ lBad,i ■ (6) 
I&TVJTs ~ 

To see this, note that each bad interval /' in the partition achieving the maximal value is 
contained in some / G J-UJ-g such that I_Bad,/ = 1- Since the intervals in the maximal partition 
must be disjoint, each such I will only be associated with at most 3 /''s. Thus, to control the 
contribution of the bad intervals, it suffices to prove a suitable upper bound on ^ that holds 
almost surely. As a shorthand notation, we define the variable 1/ := max//c/ Si- Then the 
quantity we need to bound can be written a bit more succintly as: 

3 ^I-^Bad,I. (7) 

leTuTs 

To bound ([7]), we will rely on several standard lemmas. 
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Lemma 7. (Etemadi's Inequality - Theorem 1 in JE]) Let Xi,X2,...,Xfc denote independent 
random variables and let a > 0. Let Si := Xi + ■ ■ ■ + Xi denote the partial sum. Then: 

P[max 15^1 > 3a] < 3 max ¥[\SA > a]. 

Lemma 8. (Doob) Let {Mi}^^^ be a submartingale taking nonnegative real values, and p > 1. 
Then: 

max Mi 
, i<e<L 



E 



- p 



p 



Lemma 9. (Rosenthal's Inequality - Thm. 3 in Let 2 < p < oo. Then there exists a 

constant Kp depending only on p, so that if Xi, . . . ,X£ are independent random variables with 
E[Xi] = for all i and E[\Xi\P] < oo for all i, then: 



(E[|5^|P])^/P < Kp max < 



i/p / / \ 1/2- 



vi=i / \i=i 



We now prove: 
Lemma 10. For any interval / E J-'U J-g, 

E[|y/|^+^] < 

where C is a constant depending only on 6 . 

Proof. For each / = {ig, ■ ■ ■ ,ie], we define the notation Sj^k := ^j^+i + • • • -^^i^+fc to denote the 
partial sum of the first k variables in the interval I (for values of k from 1 to |/|). We also define 
the random variable Yj as: 

Yt := max S? l.. 

i<k<\i\ 

We observe that E[|y/|i+'^] < 4'^+^E[\Yi\'^+^]. To see this, consider an arbitrary interval 
/' = (i'^,...,ig] C / = (ig, . . . , ig], where i'g / ig. We let ki denote the number of integers 
contained in {is,i'e] and we let k2 denote the number of integers contained in {is,i'g]. Then 
Si,ki = Sj' + Si^k2i so 

\Sr\ < 2max{|S'/,fcil, I'S'/^fcJ} . 

This implies E[|y/|^+'5] < 4i+%[|y7|i+'^]. (In fact, it implies the stronger fact that Yj < 4Yj 
always holds, and we will use this again later.) 

Now, {Sj^k} is a martingale, so {|5'/,fc|} is a submartingale (by Jensen's inequality). Thus, 
by Lemma [HI 

^ 2(1+5)" 

max \Sj k\ 
^i<fe<|/| 



E[|y7|i+'5] = E 



< C"E[|5/|2(i+^)]. (8) 



for some constant C depending only on 5. 
Applying Lemma [9l we see that: 



E[|5/p(i+^)] < K'\I\^+^, (9) 

for some constant K' depending only on 6. Combining ([9]) with ([8]) (and recalling that Yj < 4Yi), 
we have shown: 

K[\Yj\^+^] < C7|/|i+* 

for some constant C depending only on 6. □ 
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Our next goal is to derive a suitable upper bound on the quantity EfY/I^ad,/] for every 
interval I. To do this, we will need one more standard lemma. 

Lemma 11. (Berry-Esseen Theoren^) Let Xi, X2, ■ ■ ■ be independent random variables with 
E[Xi] = 0, E[Xf] = 1, and K[\Xi\'^+^] < M for all i, and some 7 € (0, 1]. Then there exists a 
universal constant such that for all positive integers k: 



sup 

-oo<x<oo 



< X 



-y"/'dt 



< a 



M 



To upper bound E[Y/I^a(i /], we begin by applying Holder's Inequality with p := 1 + 5 and 
1 I 1 



q defined so that - + - = 1. This gives us: 



E[17W,/] = E[|17W,/|] < \^[\Yi\ 
Applying Lemma [TOt we see that 

Since "i-Badj only takes values in {0, 1}, we also have 

M^Bad,lV] = ^[iBadA = A^Bad,! = !]• 

We now consider 



(E[|lBad,/n) 



'[I 



Bad,I 
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Yi > —1/1 InlniV 



We recall the definition of Y/ and the fact that Y/ < 41/ from the proof of Lemma [101 We then 
have: 



< 



8 

By Lemma [71 this quantity is 



^ 13 

Yi > —1/1 InlnA^ 
32' ' 



max \Sik\> \—\I\^^^(lnlnN)^/^ 

i<k<\i\ ' ' ' V 32' ' ^ ^ 



< 3 max ] 

1<A:<|7| 



\SjM>^\I\'/\lnlnN)y^ 



We will bound this probability using Chebyshev's inequality for values of k which are < j/]^^'^, 
and using Lemma [TTl for larger values of k. 

For k < we apply Chebyshev's inequality to obtain: 



< 



288E[|5/,fcp] 
Bill InlniV 



We note that E[|5/,fc|2] = E[S] = k (recall that Si^k is a sum of k independent random 
variables, each with mean and variance 1. Since k < |/|^^^, this gives us: 



\Sl,k\ > 



B 



12^/2 



|/|i/2(lnlniV)i/2 



< 



fi|/|V2 InlniV' 



^This can be found in 2 , p. 322, for example. 
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For |/|V2 <k< \I\, we apply Lemma [TT] to obtain: 



< 



Sik VB InlniV 
Vk I2V2 

D 



27r / VBlnlnJV 
12v^ 



for some constant D depending on 5 (we are applying the lemma with 7 = 2(5, and E[|Xjp^^^] 
is a constant). 

To bound the integral, we proceed as follows (assuming that is large enough so that 



12 - ^1 



•JB In In JV 



VB In In JV 
I2V2 



ye ^ ^"^dy = —e 



J/V2" 


00 


iylniv) 


VSlnlnJV 






12^2 



Thus, for <k<\I\, we have shown: 

'B 



a,|>^|/|V=,i,u„iv,v^ 



< 



1 / 1 D 



2tt \\nN 



+ 



j|5/2- 



We define a = min{l/2, 5/2}. Then, for some constant D' depending on 5 and for all k, we 
have: 



\SiM > ^|ir/2(lnlniV)V2 



< 



1 / 1 \ 576 D' 



2tt VlniV 



+ 



Thus, 



'^BadJ 



1]< 



1 \ 288 :iD' 

27r V IniV / ^ IIK' 



Putting everything together, we have that 



27r VlniV 



1 3D' 



i+s 



(10) 



Next, we show that the contribution of intervals / satisfying |/| > (log(A^))'^ to the quantity 



([7]) is not too large, where we define the parameter d :- 
and Kronecker's lemma. 



a i- 



For this, we will use (|T0 



Lemma 12. (Kronecker's Lemma) Let ai,a2, ... he a sequence of real numbers such that ai < 
^2 < ^3 • • • o,nd Oj — )• 00 as j ^ CO. Then if xi, X2, ■ ■ ■ is a sequence of real numbers such that 
^j=iXj/aj converges, 

k 

i=i 

We now prove: 
Lemma 13. Let m denote a positive integer. Then: 

1 



2"* lnln(2™) 



Yjlsad,! — ^ a.s. as m ^ 00, 



|/|>(log(2'"))'* 



where the indicator variable Isad I c-iT'd J-L)J-s are defined with respect to N = 2*" . 
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Proof. For each interval / E J-UJ-'s, there is a minimal value of n such that / C [2"]. We order 
our sum over the Fs according to their associated values of n (and otherwise arbitrarily): i.e. 
we first sum terms for Fs with n = 2, then with n = 3, and so on, and we only include those 
J's satisfying |/| > (log(2"'))'^. (We will ignore the very small number of terms with n = 1 
for convenience.) We let Ii, I2, 13, ■ ■ ■ denote the resulting ordered sequence of all intervals / 
which are contained in L) J-'s for some value of N (which is a power of 2) and also satisfy 
\I\ > (log(2"))'^. For each such Ij, we define 

m := 2"lnln(2"), 

where n is defined from /j as above. Since we consider going to infinity, we get an infinite 
sequence of Fs. For a fixed A^, we let 1^^ denote the final interval / C [A^] appearing in the 
infinite sequence. 

We also define a new indicator variable, lBad,i,n: which indicates the event that an interval 
/ is "bad" with respect to the value N := 2"' (where n is defined from I as above). Note that 
when Isad,! is the indicator for / being "bad" with respect to a larger A^, then lBad,i = 1 implies 
that lBad,l,n = 1 as well. We then have (for any A^ which is a power of 2): 

1 1 

^^^^^ E ^^^Bad,I < — E YljBad,I.,n- 
\I\>{^oz(N)Y 

It is a bit easier to work with the variables YilBad,i,n than the original variables Yil^ad.i^ since 
the latter depend on the value of A^, and the former do not. Hence we can now think of A^ 
going to infinity just in terms of adding more random variables to our sum, instead of needing 
to change the definition of all the random variables with each change of N . 
Thus, it suffices for us to prove that: 



lim — E ^h^Bad,h,n = a.s.. 

4=1 

By Lemma [12] (note that au — )• oo as M — )• oo), this follows if the sum 

oo ^ 

E — ^Ii^Bad,Ii,n 
1=1 

converges almost surely. This in turn follows if: 

oo ^ 

E-E[|y/,W,/„n|]<00. (11) 



_1 (^i 



For a fixed n, we will have contributions from intervals of size 2^ for values of j ranging from 
dlogn to n. Note that since we only consider values of n > 2, we will have j > 1. By (jlOp . the 
sum of the expectations IE[|17I_Bad,/,n|] for intervals |/| = 2^ with the value of n is at most: 

C'-2" 

where D" and C are constants depending only on 6. We then see that (|11|) is dominated by: 




^^Tnn ^ \\nj ~^ 2^ 



B/288 n"\^ ^ 



1 



fl2) 



n=2 j=dlogn 
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We now note that for any positive real values x,y,j, we have 

(x + < (2 maxja;, y}y' = 2^ maxjx'*', y"^} < 2'^(x'^ + y"^) 
Applying this to (fT2]l . we see it is 



< 



^ Inn ^ 

n=2 jr=cilogn 



l\ 576 (l-TTj) B'" 

n) 



(13) 



where C", D'" and c' are constants depending on 5. More specifically, a' = a [l ^ 
We split this into two pieces, and first consider the sum: 

oo ^ n ^ 

D"'yJ- y -L. 

We note that 



1+5 



In n ^ 2"':! 

n=2 j=d\ogn 



E 90-', 



1 



d(7' 



n 



j=dlog n 

for some constant K„i depending on a' . Therefore, 

00 ^ n ^ 00 



E In n E oo-'i - E „do-' - 



In n ^ 2<^'J 

n=2 j=a!log7i 



n=2 



Since d = Jr, this sum converges. 
Next we consider the sum 



°° 1 SI 

Inn ^ ' 

n=2 j=d\ogn 

Since X]"=diog„ 1 < n, it suffices to consider 

00 ^ 

El 1 s_c^_^^ 
. 71 576 I 1+4 / . 
Inn 



n=2 



At this point, we choose the value of B so that 



B 
576 



1 



l + <5 



1 > 1. 



This ensures that the sum converges, and the proof of the lemma is complete. 



□ 



To conclude our treatment of the bad intervals, we must also show that the contribution of 
intervals / with |/| < (log(A^))'^ is not too large. To do this, we return to considering a fixed 
value of N (which is a power of 2) and the indicator variables I^ad,/ are all with respect to 
this A^. For each / G J^U Ts, we define the random variable Zi := l/Isad,/- We first consider 
intervals I G of a fixed size 2* < (log(A^))'^. There are L := N ■ 2~* such intervals, and we 
denote the associated random variables Zj as Zi, . . . , Z^. We prove: 



i\l+2(5 



Lemma 14. 

L L 

where K is a constant depending only on 5 and not on i or N . 



N- 
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Proof. We define Zj := ZjI^Zj\<L- other words, Zj is the truncation of the (non-negative) 
random variable Zj at the value L. We can then write: 



L L 

3=1 i=i 
We consider 



> L 



L L 

E (^^- - + Y.{Zj-z'j- nzj] + E[Zj]) 



> L 



poo 

E[Zj] - ]K[Zj] = J F[Zj > t]dt. 



By Chebyshev's inequality, 
Inserting this into the integral, we obtain: 



|E|Zil-E[Zj]|<E||Zj|'+«] t-l- 



1+51 



6LS ' 



We now see that: 



L L 

j=i j=i 



> L 



< L¥[\Zj\ > L]+ ] 



^{Zj-E[Zj]) 



> L 



n\zj\'^'] 



Here, we have applied the union bound, the fact that the Zj^s are identically distributed, and 
that Zj — Zj = when \Zj\ < L. 

We will bound these two quantities separately. First, by applying Chebyshev's inequality, 
we have: 

LF[\Zj\ > L]< L-^E[\Zj\^+% (14) 

To bound the second quantity, we let L' := L si;^ — ^^'^ Zj := Zj — E[Zj]. By 

another application of Chebyshev's inequality, 



E 



< 



l\2 



Since the random variables Zj are independent and mean zero. 



E 



E^. 



= LE[Z]]. 



We then observe: 



E[Z|] = 2 / x¥'[\Zj\ > x\dx < 2 + 2 / x¥[\Zj\ > x]dx. 



(15) 
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We recall that Zj := Zj — E,[Zj]. Since Zj is a non-negative random variable which is < L, we 



have that \Zj\ < L as well. Therefore, ()15p becomes: 



2 + 2 / xF[\Zj\ > x]dx < 2 + 2 



m\z. 



1 1+5] 



2 + 2E[|Zjf +'^] / x-'dx 



,1+5 



= 2 + ^E[|Z,f+^] [L'-'-l). 

To put this all together and simplify our expressions, we recall that L := N ■ 2~*, where 
2* < (logiV)*^. Thus, L > A^(logiV)"'^. This is very large compared to the value of E[|Z,|i+'^], 
which is < E[| Y/l^"'"'^] for the associated interval /. Recall from Lemma [TOl that E[|Y/|^"^ ] is at 
most C|/|i+^ and |/| = 2* < (log(iV))"'. This means that for sufficiently large A^, we can loosely 
bound L' as: 



L' = L 



E[\Z, 



11+51 



6L^ 



> 



L 



We then have: 



> L' 



< 



E 




4E 

< 






{L'? 




L2 



< K'm\ZA'^+^\L~^ , 



for some constant K' depending on 5. Combining this with p4p . we have 

< K" (max{E[|Zjf +'^],E[|Zjf +^]}) U 



L L 

3=1 i=i 



> L 



where K" ■.= K' + 1. 



Now we consider the quantity E[|Zj|^+'^]. We note that: 



E[|Z, 



1 1+51 



E 



-7 11 1+5 



\z,-nz,]\ 



Since Zj is a non-negative random variable, \Zj — E[Zj]| < maxjZj, E[Zj]}. Then, 
(max{Zj,E[Zj]})^ < + (E[Zj])^ . Since g{x) = x^'^^ is a convex function on [0, 00) 
Jensen's inequality implies that (E[Zj])^^'^ < E[zj~'''^]. Therefore, 

E[|Zjf +'^] < 2E[Z]+'^] < 2E[Zj+'^]. 
Since Zj < Yj for the associated interval /, we have: 

L L 



Y^Z,-Y^E[Z, 



> L 



< 2K"E[\Yj\^+^]L'' < K\I\'+'L 



1 + 5 T -5 



for some constant K depending on 6, by Lemma [TOl Since L := A^/|/|, we can rewrite this as 
K\I\^+'^^N~\ Recalling that |/| = 2\ we have: 



5^Z,-5^E[Z,] 

j=i i=i 



> L 



□ 
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Now, we fix N and consider summing these error probabilities in Lemma [TJ] for all i such 
that 2* < (log(A^))'^. We also fix a value 5' such that < 6' < 6. Since the number of such i 
and all of the terms except are polylogarithmic in A^, we get that, for all A'^ sufficiently 
large with respect to 6 (and 6,6'): 



d log log N 
i=0 



\I\=T 



|/|=2' 



> N -2- 



(16) 



We will refer to the levels of and J-g with 2* < (log A^)'^ as the "low" levels. The left hand 
side of p6|) is an upper bound on the probability of the contribution of any low level of J- to 
the quantity ([7]) exceeding its expectation by more that N ■ 2~*. The very same argument can 
be applied to the low levels of J-s- Since we are considering only values of N which are powers 
of 2, and 



< oo, 



n=l 



We observe: 



we can apply the Borel-Cantelli Lemma to conclude that almost surely, only finitely many values 
of N will have a low level which contributes more than N ■ 2^* plus its expected contribution. 
When the contributions of all the low levels obey this bound, we have: 

d log log N 

Y Yl^Bad,I< iV-2-^+ Yl nYl^Badj]- (17) 

|/|<(logAf)'* |/|<(logAr)'* 

d log log A'" oo 

Y N -2-' < nY 2"' = 2A^. 

i=0 1=0 

We will bound the second quantity using (fTO]l . Recalling that B was fixed so that ^1 — > 
2, (HU]) implies: 

/ 1 D'" 

nYllBadA<C"\I\ (^ + ^ 

where C", D'" , d, a' are constants depending on 6, 6. We note that this is merely a restatement 
of p^ . Thus, we have: 



d log log N 



1 



Y nYjlBad,i]<2C"N Y IZTT-. + l^ 



|/|<(logAr)'^ 



1=0 



D'" 



(18) 



Next, we observe that 



d log log AT Ar/1 1 T\T\ 

2C"N Y -2_<c'"^(^"^^^) 



i=0 



In^iV 



In^iV 



for some constant C" depending on 6. Finally, we note that 



oo ^ 

V — 

/ ^ Oicr' 



< CXD. 



j=0 



Putting these results together with Lemma [THl we have proven that quantity ([7]) divided 
by A^lnlnA^ goes to zero as N goes to infinity, for A^'s which are powers of 2. To achieve a 
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result for all values of A^, we consider an N which is an arbitrary positive integer, and let N' 
the smallest power of 2 such that N < N' . Then N' < 2N, and iV'lnlniV' < SA^lnlnA^ for 
instance (for all but very small N). Next, we claim that the contribution of the bad intervals 

to I IjXj}-^! is bounded by quantity ([7]) for N'. To see this, note that any bad interval in the 
maximal partition of [N] will fall in an interval I & J-U J-g for N' (of size less than 4 times the 
size of the bad interval), and this / must have Isad,! = 1 (recall that we defined these indicator 
variables to detect "badness" with respect to N' /2 < N for any subintervals of sufficient size). 
Thus, we have proven: 

Theorem 15. For a positive integer N, we let Bn denote the contribution of intervals I such 
that S] > i?|/|lnlnA^ to the quantity ||{Xj}^||^2- Then: 

V n AT . 

— — — ; — — — )• (J a.s. as iV —)• oo. 

iV In In iV 

This concludes our treatment of the bad intervals. 
2.2 The Medium Intervals 

We first reduce to the case of bounded Xj's (i.e. \Xi\ < M for some constant M). We consider 
the truncation of an unbounded Xi, denoted by Xi := XiI\x,\<M- We define Zi := Xi — Xi. 
We then have: 

Wm^'Wl. = ||{(^-IE[X,]) + (Zi - E[Z,])}^\\% . 

By the triangle inequality for the norm, this is: < \ \{Xi - E[Zj]}^||^2 + ||{Zj - E[Zj]}^||^2 ■ 
Now, ' ^ are identically distributed random variables with expectation equal to 0, 

^VarlZi] 

variance equal to 1, and finite 2(1 + 6) moment. We can therefore apply Theorem [15] to conclude: 

\\{Zi-E[Zi]}^\\l, 

hmsup ■ — < BVar[Zi] a.s. 

AT^oo A* In In A* 

We recall that i? is a constant depending only on 5. Since IE[Xf] < cxd, we can choose M 
sufficiently large so that BVar[Zi] is much smaller than e, say < e/2 (and yar[Xj] is very 
close to 1). It then suffices to bound the contribution of the medium intervals for the variables 
x,-E[Xt] ^ _j_ g/2)iV Inln A^. These variables are bounded, identically distributed, mean 

^/Var[Xi] 

zero, variance 1, and have finite 2(1 + 5) moment. 

To control the contribution of the medium intervals for bounded variables, we will first 
choose parameters eo, ei > such that eo < e, ef > eo, and: 

(l + e)(l + eo)-'(l-ei)2 > 1. (19) 

The reason for this constraint will become clear later. We also define e' by (1 + e')^ = 1 + eo. 
We let M denote the bound on the absolute values of the Xj's. For a fixed positive integer A^, 
we let A^' = (1 + e')" denote the real number which is the smallest integral power of 1 + e' that 
is > A^. We now define a family of real intervals which we will denote by 7i. This will be similar 
to J-UJ-g, the family of intervals we considered in our analysis of the bad intervals, but now 
our interval lengths will be powers of 1 + e' instead of powers of two. 

Ti will be a union of several families of intervals, denoted Hq, . . . ,T-Lh, where /i is a function 
of e'. Hq consists of all intervals of the form: 

((c - 1)(1 + e')\c{l + e'n f G {0, 1, . . . , n}, c e Z n [1, (1 + e')"-^. 
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For j 7^ 0, Tij consists of all intervals of the form: 

{{c-l){lUy+je\l + eT\c{l + e'y+je\l + eT\i^{h...,n-l},ceZn[l,{l+er-'-l]- 

We will refer to intervals of size (1 + e)* as belonging to level i in each Tij. We set h (the 
maximum value of j) to be ^-p^ — 1. Essentially, we are taking the intervals in and shifting 
them by multiples of e'(l + e')*~^ and stopping when we would reach the next interval. We now 
prove: 

Lemma 16. Let I' C [N] denote an arbitrary interval. There exists some interval I £ H such 
that r QI and \I\ < (1 + e')Vl- 

Proof. We proceed similarly to the proof of Lemma El We define the integer i by the inequality 
(1 + e')*"^ < |/'| < (1 + e'Y- If i > n - 1, then / = (0, (1 + e')"] suffices. Otherwise, we consider 
intervals in 71 of size (1 + e')*"^^. There exist integers c, k such that the leftmost endpoint of / 
is > c(l + €'y+^ + ke'{l + e'Y and < c(l + e'y+^ + {k + l)e'(l + e'Y and c G [0, (1 + e')"-^-! - 1], 
k < Since |/'| < (1 + e')\ this implies that: 

/' C (c(l + e'y+^ + ke'il + e'y, c(l + e')*+' + (k + l)e'(l + e'y + (1 + e'y]. 

We can rewrite the containing interval as: 

/ := (c(l + e'y+^ + ke'il + e')^ (c +!)(! + e'y+^ + ke'il + e'y] G Tik- 

This / satisfies |/| = (1 + e')*+^ < (1 + e'f\I'\, since |/'| > (1 + e'y'\ 

We let A^" := (1 + e')'^^^. We wih refer to an interval I £% as "medium" if 



□ 



max 
/'c/ 
|/'|>(l+eo)-i|/| 



S], > 2(1 + e)(l + eo)~V|lnlniV". 



We note that I £ Ti can contain a subinterval I' of size \I'\ > (1 + e')^^|/| that is medium (in 
the sense of (HD) with respect to some N" < N < N' only when I satisfies this new condition for 
being "medium." We let I/, Med be the indicator variable for the event that I € T-L is "medium". 
Now, for any A'^ between A'^" and N' , the total length of the medium intervals (in the sense of 
([3])) in the maximal partition for is < Ylien l-^l ' ^i,Med- We will upper bound this quantity. 
We begin by upper bounding P[I/,Afed = l], for which we employ the following lemma. 

Lemma 17. Let {Xi} be a sequence of independent mean random variables such that \Xi\ < 
MVi. Then 



max 

KKL 



i=l 



> t 



for some absolute constant C . 



While this statement is certainly not new (all of the essential ideas are contained in Hoeffd- 
ing's paper [9]), we have been unable to locate a reference for this precise formulation, so we 
have included a proof in Appendix |XJ The key ingredient in obtaining this maximal form is 
Doob's maximal inequality for martingales. 

Corollary 18. Let {Xi\ be a sequence of independent mean random variables with Var[Xi\ = 
1 and \Xi\ < M for all i. Then 



max 

J/'|>(l+^o)''i 



\Si'\ > t 



f2(l-n)^/2 



< C'e 



for some absolute constant C . 
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Proof. We apply the union bound to conclude: 



max 

I' cm 



\Si'\ > t 



}I'\>il+eo)-^L 



< 



max \Se\ > (1 — eit) 



KL 



+ ¥[\S,,l\ >eit]. 



To see this, note that any subinterval /' C [L] of size > (1 + eg) must have its left endpoint 
be < I 1 — TT— I L < cqL. By Lemma \T7\ this is: 



l+eo 



< Cexp 



2(L + Mt(l-ei)/3) 



+ Cexp 



2{eoL + Mtei/3) 



Recalling that ef > eg, this is: 



2(L + Mtei/3eo)y 



□ 



Since our variables Xi are bounded in absolute value by M, the maximum of Sp for subin- 
tervals I' C / is always bounded as < (M|/|)^. Thus, for intervals / which are too small with 
respect to A^", / cannot possibly be medium. More specifically, I/, Med can only be equal to 1 

when |/| > j^^yI^^ Inln N" . Thus, we can assume N (and hence N") is sufficiently large so 
that applying Corollary [TS] yields: 



I, Med 



1] < Cexp {-{1 + 62) InlnN") = C'{lnN")-'^^+'^\ 



(20) 



for some positive €2- To see this, note that we are applying the corollary with the value 
t = y^2(l + e)(l + eo)-^|/| In IniV", and by ([IS]), (1 + e)(l + eo)"^(l - ei)^ > 1. We consider 
N" large enough so that L dominates the term Mtei/Seo in the denominator. Here, L is the 
number of integers in the interval /, which is asymptotically equal to |/| (the length of the real 
interval I). 

We consider intervals in level i of Tij (where i is large enough so that these intervals can be 
medium). We let kij denote the number of such intervals, and kij ~ (1 + e')""*. We note that 
the indicator random variables of these intervals I/^Med are independent, because the intervals 
are disjoint. We will let itj denote the total length of all the medium intervals in level i of Hj. 
By a Chernoff bound: 



Ii,Med>2C'hjilnN")-^^+''^ 



Jeveli 

When the event 



< exp (^-C'kij(lnN")-^^+'^^/3^ . (21) 



II/,Med<2C'%(lnAr")-(i+^2) 



leveli 



occurs, we have £ij < 2C'N'{lnN")-^'^+^'^\ We wih cah this event Eij. 

Since kij ~ (1 + e')"~* and In A^" ~ n, we can choose a constant d large enough so that: 

Y Yl exp(-C'hj{lnN")-^^^"'^/^) <Yj Y1 exp {-C" {1 + €T~'n~^''^) < 



00. 



n i<n—dlnn 



n i<.n—d\nn 
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By the Borel-Cantelli Lemma, we then have that almost surely, all of the events Eij (for 
i < n — dlnn) occur when n is sufficiently large (i.e. A^' is sufficiently large). 

To address the medium intervals in Tij for levels with i > n — dlnn, we note there are 
dlnn ~ dlnlnA^' such levels, and by (|20|) . the expected length of the medium intervals on each 
level is at most C"A^'(ln A^")"^"^"*"'^^)^ Therefore, for each such i, by Markov's inequality (for 
£3 < £2): 

P > C7'iV'(lniV'r-+-] < j^^j^- 

Now, the quantity Af^")"+g3 ~ (n iii(i+°0)i+'=3 converges when we sum over n. Hence, another 
application of the Borel-Cantelli Lemma tells us that, almost surely, for n sufficiently large we 
win have iij < C'N' (In N")-^^+'3 for alH > n - dlnn. 

Putting everything together, we have that almost surely, for sufficiently large n: 

< infl ,. N'lnN'{lnN")-^^+'^'^ + C'h{d\nn)N' {In N")-"'+'\ 

Since In A^" ~ In A^' and Inn ~ Inln A^', we see that this entire quantity is o{N). Thus, we have 
proven: 

Theorem 19. Almost surely, for sufficiently large N, the length of the intervals in the maximal 
partition for N which are medium in the sense of ^ is o{N). 

This completes our proof of Theorem O since the total contribution of the medium intervals 
is at most i3 In In A'^ times the length of the medium intervals, and the contribution of the good 
intervals is at most (2 + e)A''lnln A". 



3 The Lower Bound 

We now prove the following theorem: 

Theorem 20. We let Xi, X2, ■ ■ ■ denote independent, identically distributed random variables 
with E[Xi] = 0, Var[Xi] = 1, and E[|Xi|2(i+^)] < 00 for some 5 > 0. Then, for every e > 0, 

I I {^*}^ I 1 1/2 

liminf — — - — ; — > 2(1 — e) a.s. 
N^oo A^lnlnA^ ~ ^ ^ 

We first argue that it suffices to prove this theorem when the random variables Xi are 
bounded (i.e. \Xi\ < M for some constant M). To see why, we again consider the truncation 
of an unbounded Xi, denoted by Xi := XiI\Xi\<Mj ™d we define Zi := Xi — Xi. We fix values 
ei, €2 sufficiently small and a value of M sufficiently high such that: 



2(1 - €2)Var[Xi] - ^/2{1 + ei)Var[Zi] > ^/2{l-e). (22) 

There exists such a choice of M, ei, and 62 because E[^j^] < 00, so choosing M sufficiently large 
will make yar[Xj] sufficiently close to 1 and Var[Zi] sufficiently close to 0. 

Now, ^'"'^[■^'1 is a random variable with expectation equal to 0, variance equal to 1, and 

finite 2(1 + 6) moment. We can hence apply Theorem [5] with ei to conclude that: 

\\{Zi-E\Zi]}^\\l2 
hmsup "^ \ < 2{1 + ei)Var[Z,] a.s. (23) 

iV^-oo N In In N 
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We note that \\{Xi - E[Xi]}^ 
fact that E[Xi] = 0, so E[Xi] - 



|2 



[Xi - {Zi - E[Z,])} 



Iy2 



(Here, we have used the 



E[Zj].) Employing the triangle inequality, we have: 



N\ 



\{Zi-E[Zi]} 



(24) 



If we prove Theorem [20] for bounded variables, we can apply it to the Xi — E[Xj]'s with €2- 
Note that these are mean zero random variables with finite variance and finite 2(1 + 6) moment. 



and we can divide them by \/Var[Xi] to make them have variance equal to one. This gives us: 



lim inf ■ 



ly2 



N^oo ViV In In N 
Combining this with ([M]), (ESD, and ([22]), we have that 



> J2il-e2)Var[Xi] a.s. 



|2 

Iy2 

"n^'^' at In In 



lim inf 



> 2(1 - e) a.s.. 



Therefore, we may assume from this point forward that the Xj's are bounded in absolute value 
by a constant M. 

To prove the theorem for bounded variables Xi, we will use an inequality proven in and 
a general strategy motivated by [13j. We begin by following the approach in [7j for proving the 
lower bound portion of the law of the iterated logarithm for bounded random variables. We fix 
a value < a < 1/2 and a parameter 63 whose value will be set later. 

We consider the sequence of integers mi, 772-2, • • • defined by := s^, for some suitably large 
integer s > 1. As usual, we let Sm^ denote the sum Xi + ■ ■ ■ + X^^. We note the following 
inequality from [7], p. 282: 

Lemma 21. For every e' > 0, when s is sufficiently large with respect to e' , there exists < 
7 < 1 such that for all sufficiently large k, 



Snik-i > (1 - e')v^2(mA; - mfc_i)lnln 



ruk 



> 



1 



We fix a value of s and a value £4 such that: 



(1- £4)^1-1/5 > VT^3, 



(25) 



and s is sufficiently large with respect to €4 to apply Lemma [2TJ We consider values of which 
are powers of s, i.e. = s" for some integer n. 
We now prove: 



Lemma 22. For N = sufficiently large and for each fixed j £ [N], 



sup 

l<il<i2<N^-" 



{Sii+j - Sj) + {S; 



i2 



< 2(1 - e3)lnlnAf 



< exp{—cn 



(26) 



for some constants c, 7 > independent of n with 7 < 1. 



Proof. We consider values of ^2 which are = s'^ for n/2 < k < n(l — a) (i.e. ^2 = ruk)- For each 
such k, Lemma implies that: 



^j+rrik-i > (1 - e4)\/2(mfe - mfc_i)lnlnmfc 



1 

> — . 

- k^ 



(27) 
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Now, we suppose that for some n/2 < k < n(l — a), we have 



S. 



Sj+nik-i > (1 - e4)v^2(mfe - mfe_i)lnln 



We will call this event E^, and we denote its complement by Ek- When E^ occurs, we consider 
rrik and ii := nik-i- Note that 12 < iV^~". Then: 



«2 : 



{Si^+j - Sjf + (Sia+j - S'ii+j)^ > 2(1 - eif^ruk - mfc^i) Inlnmfe. 



Using (f25]l . mfc — m^-i = s''{l — 1/s), and > n/2, this is > 2(1 — e3)s'^lnlns'^. In fact, 
since Inln = In Ins" and Inlns"/^ = In Ins" + In 1/2), for sufficiently large N we have: 



Sjf + (5,,+j - Si.,+jf > 2(1 - £3)^2 InlniV. 



Therefore, the probability on the left hand side of ()26p is at most the probability that the 
event -E^ fails to occur for all n/2 < k < n(l — a). Observe that these events for different /c's 
are independent, because they involve disjoint sets of the variables Xi. Thus, by Lemma [2T\ 



P [EkVk E [n/2, (1 - a)n]] < J] 

k=\n/2'] 



1 



1 



[n{l-a)\ 
k=\n/2] 



since for positive real numbers x, 1 — x < e ^ . This can be rewritten as: 

Ln(l-a)J 

exp 




We observe that 



Ln(l-a)J 

E 



1 ^ n(l - a - 1/2) - 2 
ky ~ nT(l - a)T ■ 



k=\n/2] 

Thus, we can choose a constant c independent of n such that 

Ln(l-a)J 



Therefore, we have shown that: 



sup 

l<n<i2<Afi-" 



k=\n/2] 



Sj) + {Si2+j 



1 



> cn 



1-7 



Sii+j) 



«2 



< 2(1 - e3)lnlniV 



< exp{—cn 



l-7\ 



□ 



Of course, we need to address all values of N and not just those that are powers of s. To allow 
us to handle general values of N, we introduce the following the family of intervals, which we 
denote by C We let C denote a sufficiently large constant which is a power of s (just how large 
it should be will be determined later). We define £ as a union of C different interval families, 
denoted by Cq, . . . , Cc-i- ^0 consists of all intervals of the form: (l+s+- • -+5^"^, l+s+- • -+3^], 
where k is a non-negative integer. We note that these intervals are disjoint, and that for each 
k, there is exactly one interval of size s^. More generally, Ci consists of all intervals of the form: 



1 + S + 



+ s + ■■■ + + 



C 



C 



18 



(for k^s large enough so that C divides s^~^^ when i ^ 0). Each Ci is a union of disjoint intervals, 
one of size s'' for each (large enough) k. To visualize these intervals, first consider Cq as an 
infinite stretch of intervals, starting with one of size 1, then one of size s, then one of size s^, 
and so on, each beginning just where the previous one left off. Now imagine dividing each of 
the intervals in Cq into C pieces of equal size. Then £i can be thought of as a copy of Cq where 
the intervals have been shifted so that they now end at what used to be the end of the first 
piece of the next interval. In £2, they are shifted so that they end at what used to the end of 
the second piece, and so on. Note that these shifts will cause (relatively small) gaps between 
the intervals in Ci for i > 0. 

We let / denote an interval in C. For each j £ I, we let Aj denote the event whose probability 
is bounded in ([26]) and Aj denote its complement. We let Pj denote the set of points j G I for 
which Aj occurs. We let Ej denote the event that |P/| > e3|/|. We now show: 

Lemma 23. Almost surely, only finitely many of the events Ej occur (for I £ C). 
Proof. We consider an / G £ of size . Then, for each j € /, 

F[i G Pi] < exp{-ck^~^), 
by Lemma [22l Hence, K[Pj] < exp{—ck^^^) . By Markov's inequality, 

P[^/] < < -exp{-ck^-^). 

£35 £3 

We note that for each k, there are at most C intervals C of size s^. Therefore: 

„ 00 

^AEi] < — ^exp{-ck^-"') <oo. 

Hence, by the Borel-Cantelli Lemma, with probability one only finitely many of the events Ej 
occur. □ 

Finally, we will prove Theorem [20] by considering a sufficiently large N and defining the 
following partition of [N]. We begin by picking out a useful set of disjoint intervals in C. We 
will denote this set by S, and we consider it initially to be empty. There is a unique positive 
integer uq such that 1 + s + • • • + s*^" <A^<l + s + -- - + s"""^-*^. There also exists an iq such 
that: 

< AT _ ( 1 + + . . . + + !0£^ ^ 



c J - c 

We add to S the interval of size s"" in C which ends at 1 + s H + s"" + and we call 

this interval Ii. Now, there exists some ni and ii such that the left endpoint of Ii falls in the 
range: 

1 + S + • • • + + , 1 + S + • • • + s"i + 

We then define I2 to be the interval of size s"''^ in C which ends at 1 + s + • • • + s"''^ + ^^^^7— • 
We then consider the left endpoint of I2, and we define I3 analogously, continuing on until we 
reach a point where the next interval we would like to use does not exist. 

When we are finished, 5 is a finite set of disjoint intervals covering most of the length from 
1 to A^. To see this, note that each time we add an interval of size s"*^, we insert a gap of size 
at most s"*^^/C. For C chosen sufficiently large with respect to s, the gap will be only a small 
proportion of the length of the interval being added. Thus, we lose only a small fraction of 
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the length of N to these gaps. Additionally, we can afford to ignore the length from 1 up to 
1 + s + • • • + s""/^, since this is o(s"°) and hence o{N). Thus, at least a length of 

iV (l - ^) - (1 + s + • • • + s'^o/') (28) 

is contained in intervals in C of size at least s'^^^'^ . 

We now define our partition of A'^, choosing our endpoints iteratively. We start at 0. (Our 
current position will always be the last element of the interval we just added to the partition, 
i.e. the last endpoint we choose.) When we are in a gap between intervals in S, we choose the 
next endpoint in our partition to be the end of the gap and we move to the first point of the 
next interval in S. When we are inside an interval in S, we let j denote our current position, 
and sizcj denote the size of the interval in S. If the event A.j occurs, we add an interval of 
size 1 to our partition and move to j + 1. If Aj occurs and j + size^'" is still within the 
current interval of S, we choose the ii and 12 which maximize (Sj^+j — Sj)'^ + [Si^+j — Si^+j)'^ 
for 1 < ii < ^2 < sizej~" as the next two endpoints and move to the point 12 + j- If j + size^~°' 
lies outside the current interval of 5, we choose the next endpoint to the be the beginning of 
the next interval in S. We continue in this way until we reach A^. 

Now we must prove a lower bound (almost surely) for the sum of the squared partial sums 
over the intervals in this partition. We will ignore the gaps (which contribute an amount > 0), 
and consider the contributions of the partition pieces which lie inside intervals of S. Almost 
surely, only finitely many of the intervals I va. S have > e3|/|. We assume that A^ is large 
enough so that all of the intervals I G 5 of size at least s"o/2 have < e3|/|. Thus, the pieces 
of the partition in each of these intervals I contributes at least 

2(1- £3)2(111 -|/|i-") In In |/| 

to the sum of the squared partial sums over the partition. For |/| > s^o/^^ lnln|/| ~ InlnA^. 
Since these intervals cover a length that is at least ()28p . we can choose €3 small enough with 
respect to e and C large enough with respect to s to obtain a contribution that is > (2 — 
e)A^lnln A^. This completes our proof of Theorem 1201 



4 A Result for Convergence in Probability 



We now prove the exact asymptotic holds for convergence in probability when the variables Xi 
only satisfy yar[A'j] = 1 and not any higher moment condition. 

Theorem 24. We let Xi, X2, ■ ■ ■ denote independent, identically distributed random variables 
with mean zero and variance equal to one. Then for every e,5 > 0, 



\{Xi}^\\l, > 2(l + e)A^lnlnA^ 



< 6 



< 6 



for all sufficiently large N, and similarly, 

F \\\{Xi}^\\y2 < 2(1 - e)A^lnlnA^ 

for all sufficiently large N. 

Proof. We will use the results of |11) along with truncation, writing Xi = Xi + Zi, where X 



I ■ — 



\Xi\<M ^-nd M is chosen sufficiently large so that yar[Aj] is close to 1, and EfZ^- 
where c' is the value from Theorem H] and ei, £2 > are chosen so that ^/ei + vT+~e2 < \/r+~e. 
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Then, applying Theorem U] to the mean zero variables Zj — E[Zj], we have (for all sufficiently 
large A^): 



|{Z,-E[Z,]} 



iV||2 
Iy2 



> ei(2iVlnlniV) 



5 

<2- 



By Egorov's Theorem, convergence almost surely implies convergence in probability, so we can 
apply Theorem [1] to the bounded variables Xi — E[Xj] to obtain (for all sufficiently large N): 



\{Xi-K[X,]}^\\l, > (l + e2)2iVlnlniV 



< 



Since -^/el + \/l + €2 < \/T+^, the triangle inequality implies that: 



\{Xi} 



> 2(1 + e)A^lnlniV 



< 6. 



The second bound can be proven by an analogous argument applying Theorem[T]to Xi, Theorem 



mto Zi, and employing 



— ^r— nAr||2 



1^2 



> \ \{X,-E[Xi]} 



ly2 



\{Zi-E[Zi]} 



N\ 



ly2- 



□ 
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A Proof of Lemma [TT 

We first note Doob's inequality: 

Lemma 25. (Doob's Inequality ^Sj) Let {Mi}f^^ he a suhmartingale taking non-negative real 
values. Then 

E [Ml] 



sup Ml > t 

o<e<L 



< 



t 



We now prove Lemma [T71 Let g{y) = 2 ^ip = ^C*^" j v) _ Nqw 



(29) 



E 



i=l i=l 



./c=0 



k\ 



He 

i=l 



l + h{X,) + -h^Xfg{hX,) 



< n 1 + hE[Xi] + -h^E[Xf]g{hM) 



i=l 



< e''T:tinx,]+ih^T:tim!]g{hM) ^ g|/»2g(hM)Ef=iE[x| 



We have: 



^Xi>t 



.1=1 



and, more generally: 



max > Xi > t 
KKL ^ 



1=1 



max e 

1<^<L 



Since M^ := forms a submartingale sequence, Doob's inequality yields: 



max 

\<1<L 



Y,X,>t 



i=l 



Using the fact that g{y) < i_^y/^ for y < 3 (which follows from the Taylor expansion given 
above), we have 



max 

l<i<L 



Y,Xi>t 



i=l 



< e 



■ht 



when hM < 3. 

Taking h = ^ EX^+Mt/3 ('^'^i^h satisfies HM < 3), we have: 



exp 



1 hM 



ht 



22 



exp 



(Eii + Mi/3) Eii Ef=i ] + Mi/3^ 



E^tiE[X|] + Mi/3^ 



This establishes that 



max > Xi > t 
i<e<L^ 



1=1 



< gEf=iE[X|i]+Mt/3_ 



Applying this result to the random variables {Yi} where Yi = —Xi, we have 



max 

l<i<L 



Y^Xi < -t 



Combining these, we obtain the desired estimate 



max 

KKL 



1=1 



> t 



< 2eS 



L ^Z[xf]+Mt/3 
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